OPTIMAL HIGH-SPEED MULTI -RESOLUTION RETRIEVAL METHOD ON LARGE 
CAPACITY DATABASE 

BACKGROUND OF THE INVENTION 

Field of the Invention 

The present invention relates to an optimal high-speed 
multi-resolution retrieval method on a large capacity database, 
and more particularly to a technique for inducing an inequality 
capable of allowing an accurate and rapid retrieval of desired 
information from a database, and implementing an optimal high- 
speed information retrieval using the induced inequality. 

Description of the Related Art 

In order to search for the best match to a query based on 
a similarity measure, an exhaustive search should be performed 
literally for all data contained in a database. However, 
straightforward exhaustive search algorithms require a large 
quantity of calculation. Thus, a variety of high-speed search 
algorithms have recently been proposed. 

Berman and Shapiro have proposed introduction of a 
triangular inequality so as to remove candidates having no 
possibility to be determined as the best match (es), from a 
retrieval procedure. For a reduction of additional calculation 
quantity, they have also proposed to simultaneously use diverse 



distance measures and representative data called "key data". 
However, this method involves a considerable variation in 
retrieval speed depending on "key data", and exhibits an 
insufficient speed performance in association with large 
capacity databases. 

Recently, Berman and Shapiro has also proposed the 
application of a data structure called a "Triangle Trie" to 
achieve an improvement in performance. In this method, however, 
there is still a problem in that the retrieval speed is 
considerably influenced by the tree depth and threshold value 
of "key data". 

Meanwhile, Krishnamachari and Mottaleb have proposed a 
cluster-based indexing algorithm in which diverse data 
contained in a database are partitioned into clusters in such a 
fashion that each cluster contains data having similar 
features, in accordance with an architectural clustering 
scheme . 

In accordance with the cluster-based indexing algorithm, 
it is possible to remarkably reduce the quantity of calculation 
because query data is not compared with all data contained in a 
database, but compared with a part of the data in a retrieval 
procedure in accordance with the clustering scheme. 

In particular, the cluster-based indexing algorithm is 
suitable for large capacity databases in that the number of 



comparisons to obtain a desired retrieval accuracy is not 
linearly proportional to the capacity of the database. 

Fig. 1 is a schematic diagram illustrating problems 
involved in conventional cluster-based search algorithms. 
5 Referring to Fig. 1, the second cluster is selected as a 

candidate because its center C 2 is nearest to the query Q. In 
accordance with the illustrated search algorithm, an element X 2 
in the second cluster is selected as the best match, based on 
the distance of each element belonging to the second cluster 

10 from the query Q. However, the actual best match is the element 

X 8 of the fist cluster. 

The reason why such a problem occurs is that the center 
of the cluster, to which the actual best match belongs, is not 
always nearest to the query Q. To this end, a method for 

15 simultaneously searching for several near clusters has been 

proposed. However, this method cannot ensure an optimal 
retrieval inherently. 

Also, the conventional cluster-based search algorithms, 
which cannot ensure an optimal retrieval, have a drawback in 

20 that they cannot provide a retrieval speed sufficiently rapid 

to obtain a satisfactory retrieval accuracy. 



SUMMARY OF THE INVENTION 
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Therefore, the present invention has been made to solve 
the above mentioned problems, and an object of the invention is 
to provide an optimal high-speed multi-resolution retrieval 
method on a large capacity database for inducing an inequality 
5 capable of accurately determining effective clusters and 

ineffective clusters, and implementing an optimal high-speed 
information retrieval using the induced inequality. 

Another object of the invention is to provide an optimal 
f=% high-speed multi-resolution retrieval method on a large 

%fl-0 capacity database for inducing an inequality based on a multi- 

|=i resolution data structure for a high-speed processing, and 

yp implementing an optimal high-speed multi-resolution retrieval 

* using the induced inequality. 

ill In accordance with the present invention, these objects 

L-U15 are accomplished by providing an optimal high-speed multi- 

H resolution retrieval method on a large capacity database, 

comprising the steps of: partitioning all data contained in a 
database into a desired number of clusters each composed of 
data having similar features; deriving the lower bound of the 
20 distance between each cluster and a query, removing clusters 

having no possibility to be determined as the best matches, and 
searching, for best matches, data in clusters having the 
possibility to be determined as the best matches; and inducing 
an inequality property based on a multi-resolution data 
25 structure for reducing unnecessary feature matching computation 
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involved in a search procedure to reduce a large quantity of 
calculation . 

In accordance with this method, it is possible to 
accurately search not only for a single best match, but also 
for a plurality of more-significant best matches. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The above objects, and other characteristics and 
advantages of the present invention will become more apparent 
after a reading of the following detailed description when 
taken in conjunction with the drawings, in which: 

Fig. 1 is a schematic diagram illustrating problems 
involved in conventional cluster-based search algorithms ; 

Fig. 2 is a schematic diagram illustrating distance 
inequality features for an optional cluster in accordance with 
the present invention; 

Fig. 3 is a schematic diagram illustrating the multi- 
resolution data structure of a brightness histogram X having 2 L 
bins; 

Fig. 4 is a diagram schematically illustrating a minimum 
distance arrangement of M more-significant best matches; and 

Fig. 5 is a schematic diagram illustrating an example in 
which an erroneous determination for best matches is made. 
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DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Now, the configuration and effects of a preferred 
embodiment of the present invention will be described in detail 
with reference to the annexed drawings. 

Fig. 2 is a schematic diagram illustrating distance 
inequality property for an optional cluster in accordance with 
the present invention. 

Prior to a description of the distance inequality 
property for an optional cluster in accordance with the present 
invention, the clustering procedure for a database will be 
described. 

In accordance with the clustering procedure, the database 
is first divided into a predetermined number of clusters, that 
is, K clusters, using a MacQueen K-means clustering method, in 
order to allow data having similar features to compose one 
cluster. 

Here, the image data features may include information 
such as color, texture, and shape. In the case of audio data, 
information such as pitch may be usable for data features. Each 
of the K clusters has a mean center thereof. 

Since the computation required in the clustering 
procedure is conducted irrespective of an actual retrieval, the 
time taken to conduct the clustering procedure is not included 



in the retrieval time. The clustering for the database is 
carried out as follows: 

Step 1) The number of clusters, K {K < N) , is determined. 
Step 2) The features of the cluster centers, Ci, C 2 , 
5 and C K , are initialized. K data are optionally selected from 

the database, as the initial cluster centers. For an efficient 
initialization, the minimum distance between two cluster 
centers is not to be less than a certain threshold value. 

Step 3) For the data other than the data selected as the 
10 cluster centers, their nearest cluster centers are determined. 

Each of the determined nearest cluster centers is included in 
% the cluster associated therewith. Thereafter, each cluster 

center is updated, based on the following expression: 

15 [Expression 1] 

C k = 1 [n(<S> k )C k +X,} 

where, "Xj" represents the current element to be added to 
20 the current cluster, NX S>k" represents the current cluster, and 

"n(<3? k )" represents the number of elements belonging to the 
current cluster $ k . 

Step 4) The third step is repeated for all elements. 
Thus, a cluster center set "Ft 0 = {Ci, C 2 , C k } is finally 

25 derived. 
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Now, a scheme for solving problems essentially involved 
in the conventional cluster-based search algorithm shown in 
Fig. 1 will be described with reference to Fig. 2. 

First, an initial minimum distance dmin is derived for the 
cluster nearest to a query Q, as expressed by the following 
expression: 

[Expression 2] 

d mm = min d(X„Q) 

Based on Expression 2, the following expression may be 
induced: 

[Expression 3] 

C kmm =argC, m e U° d(C k ,Q) 

In Expressions 2 and 3, "d(X,Y)" represents the La- 
distance between two features X and Y. In accordance with the 
conventional algorithm shown in Fig. 1, the initial minimum 
distance d^n corresponds to "d(X 2 , Q) "• Thereafter, the element 
furthest from the cluster center in each of clusters other than 
the cluster <D t associated with the initial minimum distance 
dmin is determined. The distance of the furthest element from 
the cluster center in each cluster, 5 k , is defined as follows: 



[Expression 4] 

S k = X™® k d(X„C k ) 

The distance 5 k of each cluster is calculated and stored 
in advance. Based on the initial minimum distance d^n and 5 k of 
each cluster, it is then determined whether or not it is 
necessary for the current cluster to be searched for in order 
to achieve an optimal retrieval. For this determination, 
Inequality Property 1 expressed by the following expression is 
used : 

[Expression 5] 

I f d(C k ,Q)-S k > d mm , then X, e" ® k d(X, , Q) > d mm 

Expression 5 expressing Inequality Property 1 can be 
proved as follows: 

[Expression 6] 

X lmm = arg X l e" O k d(X, , Q) 

In accordance with the triangular inequality method, the 
following inequality can be induced from Expression 5: 

[Expression 7] 

d(X lmm , Q) > d(C k , Q) - d(X lmm , C k ) 
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In similar, the following inequality can be induced from 
Expression 4 : 

[Expression 8] 

S k = X, "e 0> k d(X, , C k ) > d(X lmm , C k ) 

Using Expressions 7 and 8, the following inequality can 
be obtained: 

[Expression 9] 

d(X, mm , 0 > d(C k , Q) - d(X lmm ,C k )> d(C k ,Q)-S k 

If d(C k , Q) - 5 k > dmin, the following expression is then 
established: 

[Expression 10] 

d(X lmm , Q) = X laa e" O k d(X, , Q) > d mm 

Thus, the establishment of Expression 5 is proved. 

"d(C k , Q) - 5 k " in Inequality Property 1 means the lower 
bound of the distance between the query Q and any element in 
the current cluster <£ k . 

If "d(C k , Q) - 5 k " is more than "dmin", this means that 
there is no element spaced apart from the query Q by a distance 
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less than "d^" in the current cluster 3> k . Accordingly, it is 
unnecessary to take the current cluster into consideration. 

In such a fashion, therefore, it is possible to surely 
remove all ineffective clusters by applying Inequality 
5 Property 1 . 

However, the procedure for determining the nearest 
cluster and the procedure for determining the best match still 
require a considerable quantity of calculation. In order to 
reduce this calculation quantity, an optimal retrieval method 
10 is proposed in accordance with the present invention. According 

to this optimal retrieval method, another inequality property 
is induced, based on a multi-resolution data structure. Based 
on this inequality property, an optimal high-speed retrieval 
can be achieved. 

15 Fig. 3 is a schematic diagram illustrating the multi- 

resolution data structure of a brightness histogram X having 2 L 
bins . 

For the convenience of description, it is assumed that 
the multi-resolution data structure illustrated in Fig. 3 is 
20 associated with a normalized histogram having B (B = 2 L ) bins. 

The multi-resolution data structure of the histogram X may be 
defined by a histogram data stream of M {X°, X 1 , X L }". 

Here, X corresponds to "X 1 "'. The histogram X 1 has 2 1 bins. 
This histogram X 1 is obtained by reducing the resolution of the 
25 histogram X 1+1 by 50% (1/2) . 
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Each pixel value of the histogram at the current level is 
obtained by summing together the values of two pixels in the 
histogram corresponding to the upper level adjacent to the 
current level. For example, assuming that X 1 (m) represents the 
value of the m-th bin in the histogram X 1 , this bin value X 1 (m) 
can be derived as follows: 

[Expression 11] 

X' (m) = X M (2m ~ 1) + X M (2m), 1 < m < 2 l 

Now, another inequality property, that is, Inequality 
Property 2, used for an optimal high-speed retrieval in a 
multi-resolution feature space will be described in detail. 
Inequality Property 2 can be expressed by the following 
expression: 

[Expression 12] 

d(X, Y)^d L (XJ)>d L -\X, y )>--->d l (X,Y)>--->d\X,Y)>d°(XJ) 

where, "d J (X, Y) " represents the Li-distance between two 

histograms X and Y at level 1, that is, "d(X J , Y 1 ) " . 

Expression 12 expressing Inequality Property 2 can be 
proved as follows: 
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The Li-distance d I+1 (X, Y) between two histograms X and Y 
at level "1+1" can be derived as follows: 
[Expression 13] 

d M (X,Y) = 2 j^\x M (m)-Y M (m)\ 

where, each of the histograms X I+1 and Y I+1 has 2 I+1 bins, 
and X I+1 (m) represents the value of the m-th bin in the histogram 
X i+1 . 

10 

Meanwhile, the Li-distance d 1 (X, Y) at level 1 can be 
expressed as follows: 
[Expression 14] 

15 d'(X,Y) = ^ \x'(m)-Y'(m)\ 

Since |A| + |B| > |A + B|, the following expression is 
established: 
[Expression 15] 

20 

g (\x M (2m - 1) - Y M (2m - 1)| + \x M (2m) - Y M (2m)\ ) 
> ]T \X M (2m - 1) - Y M (2m - 1) + X M (2m) - Y M (2m)\ 
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Based on Expression 13, 14 and 15, the following 
expression can be induced: 
[Expression 16] 

d M (X, Y^ 1 jg, ^S^^^^^^^^IH^t^- ¥ U X($4)\ ) 
> £ \x M (2m - 1) - Y M (2m - 1) + X M (2m) - Y M (2m)\ 

Referring to Expression 16, it can be found that 
Expression 12 is established. Thus, Inequality Property 2 is 
proved. 

Inequality Property 2 means that when "d J (X, Y) " is more 
than a particular value, "d L (X, Y) " is always more than the 
particular value. 

The distance calculation at an upper level requires an 
increased computation quantity as compared to that at a lower 
level. Also, it is possible to remove an increased number of 
ineffective candidates at the lower level, as compared to the 
upper level. When such properties are applied to a search 
procedure, it is possible to considerably reduce the 
calculation quantity required for the search procedure. 

It is assumed that "N" represents the number of data 
contained in a database I (I = {Ii, I±, In))/ and "Q°" 

represents a set of the features of the data (Q° = {Xi, . . . , 
X±, X N }). The multi-resolution features of each data are 

previously calculated and stored. 
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The high-speed multi-resolution exhaustive search 
algorithm (MSA S ) based on Inequality Property 2 can be 
summarized as follows: 

Step 1) The multi-resolution structure of a query Q is 
derived. 

Step 2) The initial minimum distance dmm is set to be 
infinite . 

Step 3) Respective values of "i" and "1" are set to be 1. 

Step 4) If 1 = L, the procedure of the algorithm then 
proceeds to step 6. If "i" is more than "N", the procedure then 
proceeds to step 7. 

Step 5) The value of "d J (Xi, Q) " is derived. If "d I (X 1 , Q) " 
is more than "d^n", the current candidate X± is then removed. 
Thereafter, respective values of "i" and "1" are updated with 
"i + 1" and "1" . If not, the value of "1 " is updated with "i 
+ 1". Subsequently, the procedure then returns to step 3. 

Step 6) If "d L (X ± , Q) " is more than "dmin", the current 
candidate X± is then removed. If not, "d^n" is updated with 
"d L (Xi, Q) " . Respective values of "i" and " 1" are updated with 
"i + 1" and "1". Thereafter, the procedure returns to step 4. 

Step 7) data having the final "dmin" is selected as the 
best match. 
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As apparent from the above description, the multi- 
resolution features of each data contained in the database are 
previously calculated, and then stored. However, the quantity 
of calculation for the multi-resolution features of query data 
should be taken into consideration because those multi- 
resolution features must be obtained within a search time. 

In the case of, for example, normalized brightness 
histogram features, only 254 additions are required in 
obtaining a desired multi-resolution histogram because the 
number of levels is 8. Accordingly, the quantity of calculation 
for the multi-resolution brightness histogram may be 
negligible, taking into consideration the fact that 511 
additions and 256 absolute value computations are required for 
one matching procedure. 

Meanwhile, although it is necessary to use an additional 
memory for storing multi-resolution histograms, such a memory 
addition may be negligible because the size of each histogram 
is considerably smaller than that of associated data. The same 
conditions are applied to other features. 

Now, a new cluster-based multi-resolution search algorithm 
(CMSA) for achieving an optimal high-speed information 
retrieval, using the optimal cluster removal condition 
according to Inequality Property 1 and the MSA S based on 
Inequality Property 2 will be described. 
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Once a query is given, the cluster center nearest to the 
query is first determined in accordance with MSA S . The distance 
dmin between the query and the initial best match of the cluster 
associated with the nearest cluster center is then derived. 

Thereafter, the best match (es) are determined by applying 
MSA S to clusters, expected as having those best matches, in 
accordance with the cluster removal condition based on 
Inequality Property 1. Since the determination of the nearest 
cluster center is carried out based on MSA S , there is no any 
value of M d L (C k , Q) " calculated in the cluster removal 
procedure . 

This is because if " d' k (C k ,Q) " is more than "dmin"/ the 
distances d l " + \C k ,Q),---,d L (C k ,Q) at respective levels higher than 
the lk-th level are not calculated. 

For this reason, there is a problem in that it is 
necessary to calculate again values of "d L (C k , Q) " for an 
introduction of Inequality Property 1, expressed by Expression 
5, in the cluster removal procedure. 

To this end, Inequality Property 1 is modified into 
Inequality Property 1.1 using the relation 

" d{C k ,Q) = d L {C k ,Q)>d'*(C k ,Q)" , as follows: 

[Expression 17] 

I f d h (C k ,Q)~S k > d mm , then X t e" O k d(X, , Q) > d mm 
where, Ik ^ L 
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In accordance with Inequality Property 1.1, where 
" d' k (C k ,Q) — S k " is more than "d mn ", it is possible to remove the 
current cluster 0 k without any loss. 
5 On the other hand, where " d' k (C k ,Q) - S k " is less than 

"dmin", the current cluster <5 k is searched because the best match 
may be present in the current cluster <£ k . For this 

determination, there is no additional quantity of calculation 
because the values d' k (C k ,Q) and (^associated with each cluster 
10 are known in advance. 

Two CMSAs based on the above mentioned inequality 
properties are proposed according to the number of output best 
matches. 

The first algorithm is a CMSA S adapted to output a single 
15 best match, and the second algorithm is a CMS Am adapted to 

output a plurality of best matches. 

The CMSA S mainly involves three processing steps. In 
accordance with this CMSA S , " C k " is first determined using 
MSA S . The initial minimum distance dmin is then derived from 
20 " O k " . Finally, MSA S is applied to candidate clusters selected 

in accordance with Inequality Property 1.1, thereby determining 
the best match. The search procedure according to CMSA S can be 
summarized as follows: 
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Step 1) MSA S is carried out to determine the cluster kmin 
having a minimum distance d'min- 

Step 2) MSA S is applied to " O tmm " under condition in 
which it is assumed that the initial "dmi n " corresponds to 
"d'min", thereby updating "d^n" as follows: 

[Expression 18] 

d mm =X, e" k mm d L {X,,Q) 

Step 3-1) "k" is set to "1". 

Step 3-2) If k = k^, "k" is updated with "k + 1". On the 
other hand, if k > K, the procedure proceeds to step 3-4. 

Step 3-3) If " d' k (C k ,Q)-8 k " is more than "d^n", the 
current cluster is removed. If not, "d^n" is updated by 
applying MSA S to " O t " . After "k" is updated with "k + 1", the 
procedure returns to step 3-2. 

Step 3-4) Data having the final "dmin" is selected as the 
best match. 

In accordance with CMSA M , " C k ^ " is first determined in 
the same fashion as that in CMSA S . Thereafter, a minimum 
distance arrangement shown in Fig. 4 and adapted to store 
respective distance values of M more-significant best matches 
is filled with those best matches in accordance with a rule to 
be described hereinafter. 
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Fig. 4 is a diagram schematically illustrating a minimum 
distance arrangement of M more-significant best matches. 

If n(<t> kmn ) > M, M more-significant best matches are filled 
in the arrangement in the ascending order, starting from that 
having the lowest value. 

If n{Q> k ^) < M, respective distances of all elements 
present in " " are calculated. The calculated values are 

stored in the arrangement in the ascending order, starting from 
the lowest value. The remaining elements, not filled with the 
calculated values, in the arrangement are stored with the 
infinite value. Using a modified MSA S , it is possible to 
determine M more-significant best matches present in " <& k " . 
This modified algorithm is referred to as "MSA M " . This MSA M can 
be summarized as follows: 

Step 1) The multi-resolution features of the query Q are 
derived. 

Step 2) All elements present in "dmi. n [-]" are initialized 
with the infinite value. 

Step 3) "i" and "1" are set to "1", respectively. 

Step 4) If 1 = L, the procedure proceeds to step 6. If i 
> «(O im ) , the procedure proceeds to step 7 . 

Step 5) "d J (Xi, Q)" is calculated. If d 1 (X±, Q) > d^n [M - 
1], the current candidate Xi is removed. Thereafter, "i" and 
"1" are updated with "i + 1" and "1", respectively. Following 
this updating, the procedure returns to step 3. If d I (X i , Q) < 
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dmirJM - 1], "2" is updated with "1 + 1". Following this 

updating, the procedure returns to step 3. 

Step 6) If d L (Xi, Q) > dmirJM - 1], the current candidate 

Xj is removed. If not, "dmi n [M - 1]" is updated with "d L (Xi, Q)". 
5 Following this updating, "dmi n [ - ]" is arranged in the ascending 

order, starting from the lowest value. Thereafter, "i" and "1" 

are updated with "i + 1" and "1", respectively. Following this 

updating, the procedure returns to step 4. 
^ Step 7) Finally, M data left in "oW [ • ] " are selected as 

; 1 0 M most-significant best matches. 

r[ As mentioned above, the updating of "dmi n [-]" is carried 

'J\ out by conducting the filling of "d^n [ • ] " while applying MSA M to 

5 " Q kmm " , and then applying MSAm to each of those selected from 

fl] the remaining clusters in accordance with Inequality Property 

&5 1.1. 

M Finally, data corresponding to "dmi n [-]" are selected as M 

more-significant best matches. Even in the case using the 
above-mentioned search algorithm, it may be practically 
impossible to accurately search for M more-significant best 
20 matches. 

Fig. 5 is a schematic diagram illustrating an example in 
which an erroneous determination for best matches is made. 

Although X 8 , X 4/ and X 2 are selected as three more- 
significant best matches in the case of Fig. 5, the actual 
25 third best match is not X 2 , but X 9 . To this end, a cluster 
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removal condition relaxed from Inequality Property 1.1 may be 
induced by substituting "d^n" for "dmi n [M - 1]". That is, the 
following Inequality Property 1.2 may be induced: 

5 [Expression 19] 

If d(C k ,Q)-S k >d mia [M-l], then X l gOj d(X„Q) > d mm [M -I] 

When the above inequality property is employed as a post- 
processing involved in the algorithm proposed in accordance 
^ with the present invention, it is always possible to accurately 

~f t 0 search for M more-significant best matches. The final CMSAm 

il using the above mentioned inequality properties can be 

summarized as follows: 
= Step 1) The cluster kmin having the minimum distance d'min 

fl| is searched for, as in step 1 of CMSA S . 

l(|5 Step 2) If n(^k mn ) > M, M more-significant best matches are 

H searched for in accordance with MS Am . Respective distance values 

of the searched more-significant best matches are stored in 
"dndnt']". On the other hand, if n(<t> k ) < M, n(O k ) distance 
values are filled in "dm±n["]" in the ascending order, starting 
20 from the lowest value. The remaining elements of the 

arrangement are stored with the infinite value. 
Step 3-1) "k" is set to "1". 

Step 3-2) If k = k^, "k" is updated with "k + 1". On the 
other hand, if k > K, the procedure proceeds to step 3-5. 
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Step 3-3) If d h (C k ,Q)-5 k > cWO], the cluster k is 
removed. After "k" is updated with "k + 1", the procedure 
returns to step 3-2. 

Step 3-4) If d h (C k ,Q)-S k < cWO], "cW-]" is updated by 
5 applying MSAm to " O t " . After "k" is updated with M k + 1", the 

procedure returns to step 3-2. 

Step 3-5) "k" is set to "1". 

Step 3-6) If it is determined that the cluster k has been 
^ searched for at step 3-4, "k" is then updated with "k + 1". If 

JfO . k > K, the procedure proceeds to step 3-9. 

]'[ Step 3-7) If d'*(C k ,Q)-S k > cWM - 1], the cluster k is 

removed. After "k" is updated with "k + 1", the procedure 
,. returns to step 3-6. 

Fu Steo 3-8) If d' k {C k ,Q)-5 k < oWnfM - 1], "cU [ • ] " is 

|15 updated by applying MSA M to " O fc " . After "k" is updated with "k 

M + 1" , the procedure returns to step 3-6. 

Step 3-9) M data corresponding to the final "d^nf-]" are 
selected as the top M best matches. 

As apparent from the above description, the following 
20 advantages are obtained in accordance with the optimal high- 

speed multi-resolution retrieval method on a large capacity 
database proposed by the present invention. 

First, the method of the present invention can be used as 
the important module of a search engine for any system used for 
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a high-speed optimal retrieval on a large capacity database, 
for example, an image or video database. 

Second, the method of the present invention is 
applicable to any multimedia database, which is capable of 
having a multi-resolution structure for image or audio data, 
to accurately and rapidly search the database for desired 
information. 
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